Techniques for estimating vocal-tract shapes from the speech signal

نویسندگان

  • Juergen Schroeter
  • Man Mohan Sondhi
چکیده

This paper reviews methods for mapping from the acoustical properties of a speech signal to the geometry of the vocal tract that generated the signal. Such mapping techniques are studied for their potential application in speech synthesis, coding, and recognition. Mathematically, the estimation of the vocal tract shape from its output speech is a so-called inverse problem, where the direct problem is the synthesis of speech from a given time-varying geometry of the vocal tract and glottis. Different mappings are discussed: mapping via articulatory codebooks, mapping by nonlinear regression, mapping by basis functions, and mapping by neural networks. Besides being nonlinear, the acoustic-to-geometry mapping is also nonunique, i.e., more than one tract geometry might produce the same speech spectrum. We will show how this nonuniqueness can be alleviated by imposing continuity constraints.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recovering vocal tract shapes from MFCC parameters

Recovering vocal tract shapes from the speech signal is a well known inversion problem of transformation from the articulatory system to speech acoustics. Most of the studies on this problem in the past have been focused on vowels. There have not been general methods e ective for recovering the vocal tract shapes from the speech signal for all classes of speech sounds. In this paper we describe...

متن کامل

Estimation of vocal-tract shape from speech spectrum and speech resynthesis based on a generative model

Precise control of articulatory parameters is difficult and prevents a physical model from generating natural sounding speech signals. To determine vocal-tract shape from speech, this paper presents an inversion method for simultaneously estimating the cross-sectional area and length of the vocal tract. In addition, we performed speech resynthesis from a time-series of estimated vocal-tract sha...

متن کامل

Estimating the vocal-tract area function and the derivative of the glottal wave from a speech signal

We present a new method for estimating the vocal-tract area functions from speech signals. First, we point out and correct a long-standing sign error in some literature related to the derivation of the acoustic reflection coefficients of the vocal tract from a speech signal. Next, to eliminate the influence of the glottal wave on the estimation of the vocal-tract filter, we estimate the vocal-t...

متن کامل

An empirical investigation of the nonuniqueness in the acoustic-to-articulatory mapping

Articulatory inversion is the problem of recovering the sequence of vocal tract shapes that produce a given acoustic speech signal. Traditionally, its difficulty has been attributed to nonuniqueness of the inverse mapping, where different vocal tract shapes can produce the same acoustics. However, evidence for the nonuniqueness has been restricted to theoretical studies, or to data from atypica...

متن کامل

Validation of Optimum Algorithm Parameters Required to Estimate Vocal Tract Shape for Children Using LPC Analysis

Severe or profound deafness in hearing impaired children, can curb their ability to speak due to the lack of auditory feedback. There has been a considerable attempt in developing commercial speech training aids for such children which give feedback of acoustic and articulatory parameters. Speech training aids based on visual feedback of vocal tract shape (VTS) are reported to be useful for the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IEEE Trans. Speech and Audio Processing

دوره 2  شماره 

صفحات  -

تاریخ انتشار 1994